Compressed Pattern Matching in the Annotated Streaming Model
نویسندگان
چکیده
The annotated streaming model was originally introduced by Chakrabarti, Cormode and McGregor [ICALP 09]. In this extension of the conventional streaming model, a single query on the stream has to be answered by a client with the help from an untrusted annotator that provides an annotated data stream (to be used with the input stream). Only one-way communication is allowed. We extend the model by considering multiple queries, and focus on on-going queries where a new output must be given every time an input item arrives. In this model, we show the existence of a data structure that enables us to store and recover information about past items in the stream using very little space on the client. We first use this technique to give a space-annotation trade-off for the annotated multi-indexing problem, which is a natural generalisation of the previously studied annotated indexing problem. Our main result is a space-annotation trade-off for the classic exact pattern matching problem in phrase-compressed strings. In particular, we show the existence of a O(logn) time per phrase, O(logn+m) client space solution which uses O(logn) words of annotation per phrase. If the client space is increased to O(n + m) then O(1) words of annotation and O(1) time per phrase suffices. Here n is the length of the stream and m is the length of the pattern. Our result also holds for the well-known LZ78 compression scheme which is a special case of phrase-compression. All of the problems we consider have Ω(n) randomised space lower bounds in the standard (unannotated) streaming model.
منابع مشابه
Pattern Matching Machine for Text Compressed Using Finite State Model
The classical pattern matching problem is to nd all occurrences of patterns in a text. In many practical cases, since the text is very large and stored in the secondary storage, most of the time for the pattern matching is dominated by data transmission of the text. Therefore the text compression can speed-up the pattern matching. In this framework it is required to develop an e cient pattern m...
متن کاملPattern - Matching Problems for
The power of weighted nite automata to describe very complex images was widely studied, see [5, 6, 7]. Finite automata can be also used as an e ective tool for compression of twodimensional images. There are some software packages using this type of compression, see [12, 6]. We consider the complexity of some pattern-matching problems for two-dimensional images which are highly compressed using...
متن کاملMultiple Pattern Matching Algorithms on Collage System
Compressed pattern matching is one of the most active topics in string matching. The goal is to find all occurrences of a pattern in a compressed text without decompression. Various algorithms have been proposed depending on underlying compression methods in the last decade. Although some algorithms for multipattern searching on compressed text were also presented very recently, all of them are...
متن کاملThe Complexity of Two-dimensional Compressed Pattern Matching
We study computational complexity of two-dimensional compressed pattern matching problems. Among other things, we design an eecient randomized algorithm for the equality problem of two compressed two-dimensional patterns as well as prove computational hardness of the general two-dimensional compressed pattern matching .
متن کاملThe Complexity of Two - DimensionalCompressed Pattern -
We consider the complexity of problems for highly compressed 2-dimensional texts: compressed pattern-matching (when the pattern is not compressed and the text is compressed) and fully compressed pattern-matching (when also the pattern is compressed). First we consider 2-dimensional compression in terms of straight-line programs, see 9]. It is a natural way for representing very highly compresse...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015